Using constituency and dependency parse features to identify errorful words in disordered language

نویسندگان

  • Eric Morley
  • Emily Tucker Prud'hommeaux
چکیده

Delayed or disordered language is a characteristic of both autism spectrum disorder (ASD) and specific language impairment (SLI). In this paper, we describe our data set, which consists of transcribed data from a widely used clinical diagnostic instrument (the ADOS) for children with ASD and children with SLI. These transcripts are manually annotated with SALT, an annotation system that applies a descriptive code to errorful words. Here we address a step in automating SALT annotation: identifying the errorful words in sentences that are known to contain an error. We propose a set of baseline features to identify errorful words, and investigate the effectiveness of adding features extracted from dependency and constituency parses. We find that features from both types of parses improve classifier performance above our baseline, both individually and in aggregate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تبدیل خودکار درخت‌بانک وابستگی فارسی به درخت‌بانک سازه‌ای

There are two major types of treebanks: dependency-based and constituency-based. Both of them have applications in natural language processing and computational linguistics. Several dependency treebanks have been developed for Persian. However, there is no available big size constituency treebank for this language. In this paper, we aim to propose an algorithm for automatic conversion of a depe...

متن کامل

A Comparison of Alternative Parse Tree Paths for Labeling Semantic Roles

The integration of sophisticated inference-based techniques into natural language processing applications first requires a reliable method of encoding the predicate-argument structure of the propositional content of text. Recent statistical approaches to automated predicateargument annotation have utilized parse tree paths as predictive features, which encode the path between a verb predicate a...

متن کامل

Tree Representations for Chinese Semantic Role Labeling

We compare different parse tree representations for the task of Chinese Semantic Role Labeling (SRL), including dependency and constituency parse trees, two tree pruning methods, and neighbor features. Three learning models are compared. By using SVM classifier with neighbor features and pruning tree to phrase level we achieve significantly better speed and accuracy than state of the art Chines...

متن کامل

Convolution Kernels for Subjectivity Detection

In this paper, we explore different linguistic structures encoded as convolution kernels for the detection of subjective expressions. The advantage of convolution kernels is that complex structures can be directly provided to a classifier without deriving explicit features. The feature design for the detection of subjective expressions is fairly difficult and there currently exists no commonly ...

متن کامل

Semantic Role Labeling Using Dependency Trees

In this paper, a novel semantic role labeler based on dependency trees is developed. This is accomplished by formulating the semantic role labeling as a classification problem of dependency relations into one of several semantic roles. A dependency tree is created from a constituency parse of an input sentence. The dependency tree is then linearized into a sequence of dependency relations. A nu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012